Search CORE

11 research outputs found

Towards a unified approach

Author: Vertechi Pietro
Publication venue
Publication date: 01/09/2020
Field of study

"Decision-making in the presence of uncertainty is a pervasive computation. Latent variable decoding—inferring hidden causes underlying visible effects—is commonly observed in nature, and it is an unsolved challenge in modern machine learning. On many occasions, animals need to base their choices on uncertain evidence; for instance, when deciding whether to approach or avoid an obfuscated visual stimulus that could be either a prey or a predator. Yet, their strategies are, in general, poorly understood. In simple cases, these problems admit an optimal, explicit solution. However, in more complex real-life scenarios, it is difficult to determine the best possible behavior. The most common approach in modern machine learning relies on artificial neural networks—black boxes that map each input to an output. This input-output mapping depends on a large number of parameters, the weights of the synaptic connections, which are optimized during learning.(...)

Repositório da Universidade Nova de Lisboa

Beyond topological persistence: Starting from networks

Author: Bergomi Mattia G.
Ferri Massimo
Vertechi Pietro
Zuffi Lorenzo
Publication venue
Publication date: 14/09/2020
Field of study

Persistent homology enables fast and computable comparison of topological objects. However, it is naturally limited to the analysis of topological spaces. We extend the theory of persistence, by guaranteeing robustness and computability to significant data types as simple graphs and quivers. We focus on categorical persistence functions that allow us to study in full generality strong kinds of connectedness such as clique communities,

k

-vertex and

k

-edge connectedness directly on simple graphs and monic coherent categories.Comment: arXiv admin note: text overlap with arXiv:1707.0967

arXiv.org e-Print Archive

Directory of Open Access Journals

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Parametric machines: a fresh approach to architecture search

Author: Bergomi Mattia G.
Frosini Patrizio
Vertechi Pietro
Publication venue
Publication date: 08/07/2020
Field of study

Using tools from category theory, we provide a framework where artificial neural networks, and their architectures, can be formally described. We first define the notion of machine in a general categorical context, and show how simple machines can be combined into more complex ones. We explore finite- and infinite-depth machines, which generalize neural networks and neural ordinary differential equations. Borrowing ideas from functional analysis and kernel methods, we build complete, normed, infinite-dimensional spaces of machines, and discuss how to find optimal architectures and parameters -- within those spaces -- to solve a given computational problem. In our numerical experiments, these kernel-inspired networks can outperform classical neural networks when the training dataset is small.Comment: 31 pages, 4 figure

arXiv.org e-Print Archive

Persistence-based operators in machine learning

Author: Bergomi Mattia G.
Ferri Massimo
Mella Alessandro
Vertechi Pietro
Publication venue
Publication date: 28/12/2022
Field of study

arXiv.org e-Print Archive

Dependent Optics

Author: Vertechi Pietro
Publication venue
Publication date: 16/05/2022
Field of study

A wide variety of bidirectional data accessors, ranging from mixed optics to functor lenses, can be formalized within a unique framework--dependent optics. Starting from two indexed categories, which encode what maps are allowed in the forward and backward directions, we define a category of dependent optics and establish under what assumptions it has coproducts. Different choices of indexed categories correspond to different families of optics: we discuss dependent lenses and prisms, as well as closed dependent optics. We introduce the notion of Tambara representation and use it to classify contravariant functors from the category of optics, thus generalizing the profunctor encoding of optics to the dependent case.Comment: 17 pages. Submitted to Applied Category Theory 202

arXiv.org e-Print Archive

Machines of Finite Depth: Towards a Formalization of Neural Networks

Author: Bergomi Mattia G.
Vertechi Pietro
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 27/04/2022
Field of study

We provide a unifying framework where artificial neural networks and their architectures can be formally described as particular cases of a general mathematical construction---machines of finite depth. Unlike neural networks, machines have a precise definition, from which several properties follow naturally. Machines of finite depth are modular (they can be combined), efficiently computable, and differentiable. The backward pass of a machine is again a machine and can be computed without overhead using the same procedure as the forward pass. We prove this statement theoretically and practically via a unified implementation that generalizes several classical architectures---dense, convolutional, and recurrent neural networks with a rich shortcut structure---and their respective backpropagation rules

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Generalized Persistence for Equivariant Operators in Machine Learning

Author: Alessandro Mella
Massimo Ferri
Mattia G. Bergomi
Pietro Vertechi
Publication venue: 'MDPI AG'
Publication date: 01/03/2023
Field of study

Artificial neural networks can learn complex, salient data features to achieve a given task. On the opposite end of the spectrum, mathematically grounded methods such as topological data analysis allow users to design analysis pipelines fully aware of data constraints and symmetries. We introduce an original class of neural network layers based on a generalization of topological persistence. The proposed persistence-based layers allow the users to encode specific data properties (e.g., equivariance) easily. Additionally, these layers can be trained through standard optimization procedures (backpropagation) and composed with classical layers. We test the performance of generalized persistence-based layers as pooling operators in convolutional neural networks for image classification on the MNIST, Fashion-MNIST and CIFAR-10 datasets

Directory of Open Access Journals

Learning to represent signals spike by spike

Author: Bourdoukan Ralph
Brendel Wieland
Denève Sophie
Machens Christian,
Vertechi Pietro
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/03/2020
Field of study

International audienceNetworks based on coordinated spike coding can encode information with high efficiency in the spike trains of individual neurons. These networks exhibit single-neuron variability and tuning curves as typically observed in cortex, but paradoxically coincide with a precise, non-redundant spike-based population code. However, it has remained unclear whether the specific synaptic connectivities required in these networks can be learnt with local learning rules. Here, we show how to learn the required architecture. Using coding efficiency as an objective, we derive spike-timing-dependent learning rules for a recurrent neural network, and we provide exact solutions for the networks' convergence to an optimal state. As a result, we deduce an entire network from its input distribution and a firing cost. After learning, basic biophysical quantities such as voltages, firing thresholds, excitation, inhibition, or spikes acquire precise functional interpretations

HAL-Inserm

Directory of Open Access Journals